Statistical theory of ideal quantum measurement processes 
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To study ideal measurement processes involving a tested system S coupled to an apparatus A, we 
rely on a minimalist, statistical formulation of quantum mechanics, where states encode properties of 
ensembles. The required final state of S+A is shown to have a Gibbsian thermodynamic equilibrium 
form, not only for a large ensemble of runs, but also for arbitrary subensembles. This outcome is 
justified dynamically in quantum statistical mechanics as result of relaxation for models with suitably 
chosen interactions within A. The known quantum ambiguity which hinders the identification of 
physical subensembles is overcome owing to a specific relaxation process. The structure of the 
states describing the subsets of runs provides a frequency interpretation of Born's rule and explains 
the uniqueness of the result for each individual run, the so-called measurement problem. 
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One of the main foundational challenges of quantum 
theory is the so-called measurement problem: Why does 
each individual run of an ideal measurement yield a well- 
defined outcome, in spite of the existence of quantum 
coherences? Does measurement theory require a specific 
principle or interpretation of quantum mechanics? Al- 
ready raised by the founding fathers, this crucial question 
has witnessed a revival [1-4] . Many authors explore mod- 
els, others propose alternative interpretations of quantum 
mechanics or go beyond it. We shall, however, approach 
the problem by just treating the tested system S cou- 
pled to the measurement apparatus A as a single iso- 
lated quantum system S+A, and analysing the dynamics 
of the measurement process in a straightforward way. A 
key point is the macroscopic size of the apparatus, which 
forces us to rely on non-equilibrium quantum statisti- 
cal mechanics. Moreover, being irreducibly probabilistic, 
quantum physics does not describe individual objects; we 
must deal with statistical ensembles, and attempt to infer 
the properties of a single run of the measurement from 
those of the subensembles of runs to which it may belong. 

Technically, we have to solve the equations of motion 
for the joint density operator V(t) of S+A. We denote by 
s = J^i s i^i the observable of S to be measured, which 
has the eigenvalues Sj and the associated eigenprojec- 
tions Ili, and by fit) = tvpj^if) and 7l{t) — trs£>(i) the 
marginal density operators of S and A, respectively. At 
the initial time t = 0, S and A are uncorrelated, S lies 
in some state f(0) and A in a metastable state 71(0), so 
that V(0) = f (0) (g> 71(0). Triggered by its coupling with 
S, A may switch towards one or another among its stable 
states 7Zi , which should have equal entropies and energies 
so as to avoid bias in the measurement. These states can 
be distinguished from one another through observation 
or registration of the pointer variable Ai, identified with 
the expectation value tr^ii^A of some collective observ- 
able A of A. The final indication A; of the pointer must 
be fully correlated with the eigenvalue Si of the tested 
observable s and with the production of the final state 
fi for S. We wish therefore to show that, for each run of 



the measurement, S+A reaches at some final time tf one 
among the states [5] 
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with Born's probability pi = (fl,;) = trsIiir(O) . Parts of 
this task have been achieved for models reviewed in [1-4], 
while a detailed solution of the Curie- Weiss model [1, 6]. 
illustrates its steps. We focus below on some key ideas 
which apply to arbitrary ideal measurements. 

1. Statistical formulation of the principles of quan- 
tum mechanics. We tackle the measurement problem 
within the statistical formulation of quantum mechan- 
ics, also called "statistical interpretation" or " ensemble 
interpretation" . It is a minimalist, formal description 
and does not prejudge an interpretation of the quantum 
oddities [7]. Physical interpretations should emerge at 
the macroscopic scale, in experimental contexts, as will 
be discussed in § 5 for ideal measurements. 

Physical quantities pertaining to an ensemble of sys- 
tems are described as observables represented by Her- 
mitean matrices in a Hilbert space. Observables are com- 
parable to random variables, but their randomness is un- 
escapable, since it arises from their non-commutation, 
not solely from a lack of knowledge or of control. 

A quantum state, whether pure or not, encompasses 
the probabilistic predictions that one may make about 
the various observables. It is characterised by a cor- 
respondence between observables O and real numbers, 
implemented as O 4 (O) = trVO by means of a Her- 
mitean, normalised and non-negative density operator D. 
Such a definition looks analogous to that of a state in 
classical statistical mechanics, encoded in a density in 
phase space. However, "quantum expectation values" 
(O), "quantum probabilities" such as (11^) and "quantum 
correlations" such as (sA) present a priori only a formal 
similarity with standard expectation values, probabilities 
and correlations; they express not only some ignorance as 
in classical physics, but also the inadequacy to represent 
physical quantities by c-numbers. 
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As a standard probability distribution, a state of a sys- 
tem gathers information about it, and hence refers, im- 
plicitly or not, to a statistical ensemble £. So a state does 
not "belong to a system" , it is not an intrinsic property: 
we should regard an individual system as an element of a 
large, real or virtual, statistical ensemble £ of systems 
produced under the same conditions. As in ordinary 
probability theory, we may update our information by 
selecting some subensemble £ su b to which the considered 
system belongs, and thus assign to it a new state. 

2. System plus apparatus in thermodynamic equilib- 
rium. We suppose that the compound system S+A is 
isolated, including in A a thermal bath or an environment 
if present. As usual for macroscopic systems, isolation of 
S+A is implemented by specifying the average energy. 
The properties of S+A are governed by the Hamiltonian 
H = Hs + Ha + -ffsA, which must have specific features. 
The part Ha associated with the macroscopic appara- 
tus should produce several thermodynamic equilibrium 
states IZi if A is decoupled from S. A typical example 
is provided by spontaneously broken discrete invariance, 
the pointer variable A, being the order parameter. 

As we wish to deal with ideal, non-demolishing mea- 
surements, the process should perturb S as little as pos- 
sible: any observable compatible with s, i.e., commut- 
ing with its eigenprojectors IT, should remain unaffected. 
The conservation of all these observables is expressed by 
the fact that H depends on S only through the projectors 
Ilj. In particular the coupling must have the form 



H SA = ^2ili®hi, (2) 

i 

where hi are operators of A. Moreover, if s takes the 
value Si, that is, IT the value 1, A should lie near its 
stable state Hi, the pointer variable being close to A^. 
This can be achieved if each hi depends mainly on the 
pointer observable A and behaves as a source that breaks 
explicitly the invariance by energetically favouring A; t . 

In this section, we disregard the dynamics of the mea- 
surement process and determine the general form 2? cq for 
the thermodynamic equilibrium states of the compound 
system S+A. To this end we employ the maximum en- 
tropy method, a basic criterion for assigning a state to a 
system in equilibrium [8]. We thus maximise von Neu- 
mann's entropy — trPlnP under constraints accounting 
for the conserved quantities, which are here the energy 
(H) and the expectation values of the set of all observ- 
ables of S commuting with the projections IT. 

Associating Lagrange multipliers with all such expec- 
tation values, we find for the equilibrium states a Gibb- 
sian form where the exponent involves an arbitrary linear 
combination of these conserved observables, i.e., V eq oc 
exp(— /3H + J2i Vi) where jji is any operator of S acting 
inside the diagonal block % (so that jji = fljj/jfij). Noting 
that the exponent, which commutes with the projections 



IT, has a block diagonal structure, we find for these equi- 
librium states the structure 

i i 

Each factor qiXi, which arises from exp(— /3_ffg + jji), 
is an arbitrary block diagonal operator of S, where 
&i = IliXitli, tisXi — 1 can be interpreted as a den- 
sity operator of S. (If the eigenvalue Sj is non degener- 
ate, one just has Xi = Hi.) Furthermore, each factor 
oc exp[— P(Ha + hi)} in (3) can be interpreted as a 
canonical equilibrium density operator in the space of A. 
Here the invariance is explicitly broken by adding to the 
Hamiltonian of A the source term hi that arises from (2). 
This term should be sufficiently large so that JZ\ contains 
no admixture of i' ^ i for a macroscopic apparatus (oth- 
erwise different Sj's are not properly distinguished). It 
should not be too large though, so that TZ 1 } relaxes to IZi 
after Hsa is switched off. Thermodynamic equilibrium 

(3) entails a complete correlation between the eigenvalue 
Si of s and the value Ai of the pointer variable. 

Returning to a measurement process described by the 
dynamics of a statistical ensemble, consider a large set £ 
of runs for which S+A lies initially in the state X>(0) = 
r(0) ® TZ(0). The final state for the full ensemble £ is 
expected from (1) to be 

z>(*0 = y £,Pii>i = y £,Pih ® Ki. (4) 

i i 

We can identify it with a thermodynamic equilibrium 
state (3) with ^ = pi and Xi = fj, for which relaxation 
ofTZi — > 7Zi has taken place after switching off Hsa- The 
free parameters of T> eq are determined by the initial state 
2?(0) and the conservation laws as qiXi = ITf (0)IT. 

The state Vi defined in (1), expected to describe the 
subensemble £j obtained by selecting within £ the runs 
tagged by the value Ai of the pointer, is also identified 
with an equilibrium state for which all qj with j ^ i van- 
ish. For an arbitrary subset £ suh of runs having yielded 
a proportion qi of individual runs with outcomes Ai, the 
expected final state 

Z>Bub(tf) = JZ*Z>* (5) 

i 

is again a thermodynamic equilibrium state (3) with 
Xi = fi (see § 4). Thus, a dynamic confirmation of 

(4) and (5) amounts to prove that S+A relaxes from 
2?(0) to Gibbsian equilibrium, for £ and for its arbitrary 
subensembles £ su b, in an evolution governed by H. The 
proof must rely on quantum statistical dynamics and on 
adequate properties of A. It should also provide the char- 
acteristic time scales of the process. We sketch its main 
steps in sections 3 and 4. 
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3. Dynamics of S+A for the full set of runs. We first 
outline some general features of the dynamics of the en- 
semble £ of coupled systems S and A. The evolution of 
T>(t), though governed by the Liouville-von Neumann 
equation ihdT>/dt = [H,T>], is irreversible. This implies, 
as usual in statistical mechanics, the use of approxima- 
tions, which should be justified through a suitable choice 
of parameters in Ha and Hsa- For a large apparatus, we 
are permitted to disregard correlations between a macro- 
scopic number of degrees of freedom: In agreement with 
the definition of a state, we may drop information about 
unattainable observables of A. We are also allowed to for- 
get about recurrences that would occur after unreason- 
ably large times. A dynamical explanation of quantum 
measurements thus requires physically justified approxi- 
mations, based on a suitable choice of parameters in Ha 
and Hsa, in the same way as the explanation of irre- 
versibility in classical statistical mechanics [1]. 

Taking into account the properties of H, in particular 
the form (2) of Hsa and the property [Hs,f(0)] = 0, 
which ensures that the state f(t) of S is perturbed only 
by the interaction Hsa, it is easy to check that V(t) can 
be parameterised as 

D(t)=5^n i f(0)fi J -®W ij -(t) (6) 

in terms of a set Hij(t) = TZ^^t) of operators in the 
Hilbcrt space of A, to be determined by the equations 



lh ~dt = ^ A + ^ioit) - K l3 (t)(H A +hj), (7) 

with the initial conditions 7^,(0) = 7^.(0). The dynam- 
ics thus involves solely the apparatus, irrespective of the 
tested system, a specific property of ideal measurements. 

To derive dynamically (4) from (7), one has to study 
two independent processes, characterised by different 
time scales. For i ^ j, the coherent contributions TZij(t) 
should decay so as to eventually vanish. For i = j, lZu(t) 
should relax to the equilibrium state TZ^ in the presence 
of the source hi (which requires this source to be suf- 
ficiently large), and then to after Hsa is switched 
off; the required correlation between Si and Ai is thereby 
established. Both processes are essential. They resort 
to standard techniques of non-equilibrium statistical me- 
chanics, which have been applied to many models [1- 
4]. It will therefore be sufficient to acknowledge here 
this possibility of confirming the thermodynamic result 
(4) through microscopic dynamics. However, the under- 
standing of measurements requires further steps. 

4- Final states for arbitrary subensembles. Indeed, 
once the required expression (4) of the final state of S+A 
for the full ensemble £ of runs has been derived, the 
task is not yet achieved. We also need to prove that 
all the states 73 su b(t) associated with arbitrary subsets 



£sub of runs reach an equilibrium form (5). All possible 
subsets must be considered; those obtained by extracting 
at random runs from the full set £ are useless for our 
purpose, since they are described by V(t) itself. To study 
a subensemblc described by 2? su b(i), we can rely on the 
dynamical equations (7), but we now lack information 
about initial conditions. All we know is that, at a time 
tsplit taken as initial time for 2? su b, V can be split as 

f (*split) = kV suh (t spVlt ) + (1 - k)T> Cs ub(t sp iit), (8) 

where < k < 1 denotes^ the proportion of runs in 
£ su b and where the state I>Csub describes the comple- 
ment of £ S ub- However, if we conversely decompose the 
given state V as in (8), nothing tells us that there ex- 
ists a subset of runs extracted from the full ensemble 
£ which would be described by the component 2? S ub- 
This difficulty arises from a quantum ambiguity, to wit, 
a mixed state possesses many different physically incom- 
patible decompositions. As defined in section 1, a state 
characterises a correspondence O M> (O) = trVO. In 
classical probability theory, specifying likewise the ex- 
pectation value of any random variable implies the exis- 
tence of probabilities for individual events, and hence de- 
termines the probability distributions for the subensem- 
bles of events. On the contrary, in quantum theory, 
it is fallacious to believe that a term X> su b issued from 
some mathematical splitting (8) of T> has any physical 
relevance in terms of a real subensemble. As an illus- 
tration, consider an unpolarised ensemble of spins; its 
density operator equals both i(|z)(z| + |— z)(— z\) and 
\{\x){x\ + \—x)(—x\) where \z) denotes a pure state po- 
larised in the z-direction. If these two decompositions 
were meaningful, there would exist subensembles of spins 
polarised simultaneously in two orthogonal directions! 
Likewise, as any mixed state, T> possesses physically in- 
compatible decompositions, so that we cannot infer from 
its sole knowledge whether a term T> S ub of (8) is or not 
associated with one among the real subsets of runs £ su b 
in which we are interested. 

Our strategy will therefore rely on the considera- 
tion of any mathematically allowed decomposition (8) of 
D (fspiit), the first element of which being taken as ini- 
tial state 73 su b(i S piit)- Although nothing ensures that the 
operators 73 su b(£) thus defined are physically meaningful 
(and in most cases they will not), we are ascertained that 
their class will include the physical states describing all 
real subsets of runs that appear in practice. 

By studying the dynamics of T> sn \ ) (t) for t > i su b, we 
wish to prove that any admissible candidate 2? su b(i) for 
a physical state ends up in the expected form P su b(*f) = 
^j^qiVi. This result will help us to overcome the quan- 
tum ambiguity, since states of this form are compati- 
ble. It is sufficient for our purpose to take 2? su b(i S piit) = 
I^C^spiit)) (ispiit) | as being a projection on some pure 
state, since any Dsub^spiit) can be generated as a sum 
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of projections on its eigenvectors. To simplify the dis- 
cussion, we assume the eigenvalues Sj of s to be non- 
degenerate, so that fj = Ilj = \si)(si\. Owing to the 
macroscopic size of A, the relative fluctuations of energy 
in the canonical states TZi are small, and we may replace 
its canonical equilibrium states^ TZi with broken invari- 
ance by microcanonical states TZ^. The relative fluctua- 
tions of the pointer observable A around the value Ai are 
small in each state TZ^, as well as the margin in which 
the energy lies for TZ^ . We note as \Ai,rf) the kets of the 
Hilbert space of A that are involved, on the same footing, 
in As the spectrum is dense, the index 77 can take a 
very large number G of values, and TV? has the form 



(9) 



We choose the initial time t sp nt for the dynamics of 
2? su b sufficiently late so that 2}(t sp iit) has reached the 
form J2iPi^i m which S and A are correlated through 
i'i = \ s i)(si \ <8> TZ^. (Due to the quantum ambiguity, we 
cannot yet assign T>i to a subensemble of runs.) Hence 
the state 2?(i S piit) involves only matrix elements within 
the Hilbert subspace of S+A spanned by the correlated 
kets \si)<gi\Ai,ri). We can see that this property also holds 
for X> sub (i sp iit) = |*(* S piit))(*(*spiit)| as a consequence of 
Eq (8): otherwise, the operator T>(t sp nt) — fcZ^sub(^spiit) 
for some < k < 1 could not equal the non-negative 
operator (1 — fc)Pcsub- The initial state allowed by this 
constraint has therefore the correlated form 

|*(*B P iit)> = l>(i, v)\si)®\A u rj), Y)u(i, v )\ 2 = 1. (10) 

i.ri i.ri 

The coupling Hsa has been switched off before t sp nt, 
and as in § 3, Hs is ineffective. Thus, the density op- 
erator t> suh (t) = of S+A is governed for 
t > t S piit simply by the Hamiltonian Ha- For suitable 
interactions within A, Ha induces among the kets \Ai, 77) 
transitions which modify 77, keeping unchanged the en- 
ergy and the value of the pointer variable Ai (see [1] for 
examples). Then, for a large apparatus, an irreversible 
transformation takes place as in section 3, but over a 
shorter time scale T su b since no macroscopic energy trans- 
fer is involved. The mechanism is the same as the stan- 
dard microcanonical relaxation [9], but several thermo- 
dynamic equilibrium states coexist here. Two types of 
irreversible processes simultaneously occur in the contri- 
butions of A to 2? su b(t). On the one hand, the contri- 
butions from \Ai,t]){Aj,r]'\ with i ^ j fade out. On the 
other hand, for i = j, the state of A relaxes to equi- 
librium as in the standard relaxation towards a single 
microcanonical state (the terms with 77 ^ 7/ disappear 
and the populations of the kets \Ai,Tr) equalise within 
each i, producing the factor TZ^). Altogether [10], at the 



final time tf > t sp ut + r su b, £> S ub reaches, within approx- 
imations standard in statistical mechanics and justified 
for a macroscopic apparatus, the expected form (5): 



sub 



qi = J2\U(i,T))f. (11) 



Thus, provided some specific features of the appara- 
tus produce the above dynamics, the form J+ Qi^i is 
reached for any mathematically admissible initial condi- 
tion 'Dsub(ispiit), in particular by the density operators 
£>sub(*f) °f S+A that describe the outcome of any phys- 
ical subensemble £ su b of runs. The coefficients qt of the 
various subensembles are related to one another by a hi- 
erarchic structure: If two subensembles S^ h and of 
E, described by t>^ h and and having AA 1 ) and 

A/"*- 2 -* elements, respectively, merge into a subensemble 
£ su b, the coefficients satisfy the standard addition law 



A/-W+AA 2 ) 



A^ (1) +AA 2 > 



,(2) 



(12) 



5. Emergence of classical probabilistic interpretation. 
As stated in § 1, quantum theory deals with an individual 
system only if it is embedded in a statistical ensemble, 
possibly a gedanken ensemble, but physically consistent. 
The state T>i of S+A expected to describe the outcome 

(I) of a single run refers in fact to a specific subensemble 
Si. Within the statistical approach, physical subensem- 
bles £ su b (including Si) cannot a priori be identified, since 
they can be taken into consideration only after the time 
tspiit when V{t) has reached the form (4). However, suit- 
able dynamics of the apparatus have allowed us to derive 
all the general properties that the physical subensembles 
must satisfy, to wit, the simple form (5) of their associ- 
ated density operators and their hierarchic property (12). 

To infer results about single runs, we note that this 
hierarchic structure of subensembles is just the same as 
that of ordinary probabilities, when these are interpreted 
as relative frequencies of occurrence of individual events 

[II] . Once we are ensured that the states of all possible 
subensembles have the form (5) and obey this structure, 
we are led to complement the principles of quantum me- 
chanics with the following natural idea: We interpret the 
coefficients qi associated with a subensemble £ su b as the 
proportion of runs, extracted from the full ensemble S, 
having yielded the outcome Ai . This identification entails 
that £ can be split, as in an ordinary probabilistic pro- 
cess, into subsets Si of runs which provide the outcome 
Ai, each Si containing Born's proportion pi of runs. An 
individual run then belongs to one among the subsets 
Si, it can be assigned the state T>i, and a solution of the 
quantum measurement problem is thereby found. 

Thus, a classical interpretation of some quantum ob- 
jects emerges from a measurement process. The states 
defining the correspondence O <— > trVO have only a 
mathematical nature; the c-numbers (O) present only a 
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formal analogy with ordinary expectation values. How- 
ever, at the end of a measurement, we have just seen that 
the coefficients p, = (flj) = trsr(0)IIj can be identified 
with ordinary probabilities in the frequency interpreta- 
tion. Accordingly, (s) = J2iPi s i an< ^ = ^aJ^i-A ap- 
pear as ordinary statistical averages, and (s 2 ) — (s) 2 as 
an ordinary variance; the correlation between s and A in 
T>(tf) has a classical nature. Thus, in the same way as 
irreversibility emerges in statistical mechanics for macro- 
scopic systems from the underlying reversible microscopic 
theory, a more remarkable type of emergence occurs at 
the end of a measurement process using a macroscopic 
apparatus: Classical probabilities emerge from quantum 
theory, although the underlying "quantum probabilities" 
were non-commutative and could not a priori be regarded 
as frequencies in the absence of an experimental context. 
Thus, certain properties of S, encoded statistically in the 
initial state f (0) of the ensemble £ , are selected by the 
measurement context and revealed by the outcomes of 
the individual runs of the process. 

The occurrence of the factor TZi in the state T>i means 
that the runs are tagged by the value Ai of the pointer 
variable, which can be observed or registered at the 
macroscopic scale. By picking out the specific outcome 
Ai, one extracts from the whole ensemble £ of runs the 
subcnsemble £i that ends up in the state T>i. Two steps 



are thus necessary to go from the initial state £>(0) to 
T>i. First, the evolution of the coupled system S+A for 
the full ensemble £, governed by the Liouville-von Neu- 
mann equation (with approximations justified for large 
A), leads its state from V(0) to T>(tf) expressed by (4); 
correlations are created, and information is transferred 
with some loss due to irreversibility. However, the sub- 
sequent reduction of the state from V{U) to one of its 
components T>i is not a consequence of some evolution, 
but the mere result of selecting Ai among the registered 
outcomes. This change of state, associated with the ex- 
traction from £ of the subset £ i} should be interpreted 
as an updating of information, in the same way as the 
change of an ordinary probability distribution, associ- 
ated with some gain of information. A further step is 
involved in the reduction of the state of S from f (0) to f i} 
the disregard of A after selection of Ai. 

Thus, the statistical formulation of quantum mechan- 
ics, though abstract and minimalist, which involves in 
fact no actual interpretation, is sufficient to explain all 
the features of ideal quantum measurements, and their 
classical interpretation, provided the apparatus is macro- 
scopic and satisfies adequate dynamical properties. We 
thus advocate the usage of this formulation in teaching. 
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